Search results for "De Bruijn sequence"

showing 4 items of 4 documents

On shortening u-cycles and u-words for permutations

2017

Abstract This paper initiates the study of shortening universal cycles (u-cycles) and universal words (u-words) for permutations either by using incomparable elements, or by using non-deterministic symbols. The latter approach is similar in nature to the recent relevant studies for the de Bruijn sequences. A particular result we obtain in this paper is that u-words for n -permutations exist of lengths n ! + ( 1 − k ) ( n − 1 ) for k = 0 , 1 , … , ( n − 2 ) ! .

QA75De Bruijn sequenceApplied Mathematics0211 other engineering and technologies021107 urban & regional planning0102 computer and information sciences02 engineering and technology01 natural sciencesCombinatorics010201 computation theory & mathematicsFOS: MathematicsDiscrete Mathematics and CombinatoricsMathematics - CombinatoricsCombinatorics (math.CO)Mathematics

researchProduct

Variable-order reference-free variant discovery with the Burrows-Wheeler Transform

2020

Abstract Background In [Prezza et al., AMB 2019], a new reference-free and alignment-free framework for the detection of SNPs was suggested and tested. The framework, based on the Burrows-Wheeler Transform (BWT), significantly improves sensitivity and precision of previous de Bruijn graphs based tools by overcoming several of their limitations, namely: (i) the need to establish a fixed value, usually small, for the order k, (ii) the loss of important information such as k-mer coverage and adjacency of k-mers within the same read, and (iii) bad performance in repeated regions longer than k bases. The preliminary tool, however, was able to identify only SNPs and it was too slow and memory con…

Burrows–Wheeler transformComputer science[SDV]Life Sciences [q-bio]Value (computer science)SNPAssembly-free0102 computer and information scienceslcsh:Computer applications to medicine. Medical informatics01 natural sciencesBiochemistryPolymorphism Single Nucleotide03 medical and health sciencesBWTChromosome (genetic algorithm)Structural BiologyHumansSensitivity (control systems)Molecular Biologylcsh:QH301-705.5Alignment-free; Assembly-free; BWT; INDEL; SNP030304 developmental biologyAlignment-free; Assembly-free; BWT; INDEL; SNP;De Bruijn sequence0303 health sciencesSettore INF/01 - InformaticaAlignment-freeApplied MathematicsResearchGenomicsSequence Analysis DNAINDELData structureGraphComputer Science ApplicationsVariable (computer science)lcsh:Biology (General)010201 computation theory & mathematicsAdjacency listlcsh:R858-859.7Suffix[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]AlgorithmAlgorithmsBMC Bioinformatics

researchProduct

Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut

2013

Abstract Background The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. Results We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different ass…

Taxonomic classificationComputational biologyBiologyGenomeContig MappingContig MappingUser-Computer Interface03 medical and health sciencesAnnotationDatabases GeneticGeneticsRefSeqCluster AnalysisHumansComputer SimulationTaxonomic rank030304 developmental biologyDe Bruijn sequenceInternetPrincipal Component Analysis0303 health sciencesBacteriaContigChimera identification030306 microbiologyComputational BiologyFunctional annotationViral metagenomeIntestinesAssembler performanceMetagenomicsVirusesMetagenomicsAlgorithmsResearch ArticleBiotechnologyBMC Genomics

researchProduct

Parallelized short read assembly of large genomes using de Bruijn graphs

2011

Abstract Background Next-generation sequencing technologies have given rise to the explosive increase in DNA sequencing throughput, and have promoted the recent development of de novo short read assemblers. However, existing assemblers require high execution times and a large amount of compute resources to assemble large genomes from quantities of short reads. Results We present PASHA, a parallelized short read assembler using de Bruijn graphs, which takes advantage of hybrid computing architectures consisting of both shared-memory multi-core CPUs and distributed-memory compute clusters to gain efficiency and scalability. Evaluation using three small-scale real paired-end datasets shows tha…

Hybrid genome assemblyParallel computingComputational biologyBiologylcsh:Computer applications to medicine. Medical informaticsBiochemistryAssemblersStructural BiologyHumansThroughput (business)Molecular Biologylcsh:QH301-705.5De Bruijn sequenceGenomeContigBacteriaGenome HumanApplied MathematicsMessage passingDNA sequencing theoryComputational BiologyHigh-Throughput Nucleotide SequencingComputer Science Applicationslcsh:Biology (General)comic_booksScalabilitylcsh:R858-859.7comic_books.characterSoftwareResearch ArticleBMC Bioinformatics

researchProduct